Computationally efficient sibship and parentage assignment from multilocus marker data.

نویسنده

  • Jinliang Wang
چکیده

Quite a few methods have been proposed to infer sibship and parentage among individuals from their multilocus marker genotypes. They are all based on Mendelian laws either qualitatively (exclusion methods) or quantitatively (likelihood methods), have different optimization criteria, and use different algorithms in searching for the optimal solution. The full-likelihood method assigns sibship and parentage relationships among all sampled individuals jointly. It is by far the most accurate method, but is computationally prohibitive for large data sets with many individuals and many loci. In this article I propose a new likelihood-based method that is computationally efficient enough to handle large data sets. The method uses the sum of the log likelihoods of pairwise relationships in a configuration as the score to measure its plausibility, where log likelihoods of pairwise relationships are calculated only once and stored for repeated use. By analyzing several empirical and many simulated data sets, I show that the new method is more accurate than pairwise likelihood and exclusion-based methods, but is slightly less accurate than the full-likelihood method. However, the new method is computationally much more efficient than the full-likelihood method, and for the cases of both sexes polygamous and markers with genotyping errors, it can be several orders faster. The new method can handle a large sample with thousands of individuals and the number of markers limited only by the computer memory.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parentage and sibship inference from multilocus genotype data under polygamy.

Likelihood methods have been developed to partition individuals in a sample into sibling clusters using genetic marker data without parental information. Most of these methods assume either both sexes are monogamous to infer full sibships only or only one sex is polygamous to infer full sibships and paternal or maternal (but not both) half sibships. We extend our previous method to the more gen...

متن کامل

Effective number of breeders from sibship reconstruction: empirical evaluations using hatchery steelhead

Effective population size (Ne ) is among the most important metrics in evolutionary biology. In natural populations, it is often difficult to collect adequate demographic data to calculate Ne directly. Consequently, genetic methods to estimate Ne have been developed. Two Ne estimators based on sibship reconstruction using multilocus genotype data have been developed in recent years: sibship ass...

متن کامل

Reliable effective number of breeders/adult census size ratios in seasonal‐breeding species: Opportunity for integrative demographic inferences based on capture–mark–recapture data and multilocus genotypes

The ratio of the effective number of breeders (Nb) to the adult census size (Na), Nb/Na, approximates the departure from the standard capacity of a population to maintain genetic diversity in one reproductive season. This information is relevant for assessing population status, understanding evolutionary processes operating at local scales, and unraveling how life-history traits affect these pr...

متن کامل

Genetically reconstructed pedigrees: The costs and benefits of using full-sibling structure to constrain parentage assignments

22 We present a simple yet effective method to improve parentage assignment 23 (PA) accuracy an average of 47% compared to the PA programs PEDAPP 24 (39%), PASOS (53%), and CERVUS (50%) as measured over a wide range of 25 simulated scenarios. The method, termed sibship constraint (SC), uses the 26 results of sibship reconstruction (SR) to constrain assignments from PA output. 27 It works by ass...

متن کامل

Short tandem repeat-based identification of individuals and parents.

Estimation of short tandem repeat (STR) multilocus genotype frequencies for the identification of individuals and estimation of allele frequencies for parentage assignment both depend on (a) testing a lot of loci, (b) high levels of polymorphism at each locus tested, and (c) independence among alleles. Independence is critical, because the estimation of multilocus genotype and gamete frequencie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genetics

دوره 191 1  شماره 

صفحات  -

تاریخ انتشار 2012